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We prove here a version of Bell Theorem that does not assume locality. As a consequence classical 
realism, and not locality, is the common source of the violation by nature of all Bell Inequalities. 



A) Bell Inequalities and Bell Theorems: 

Consider a sequence of spin-i particles, each prepared 
to have positive spin along the arbitrary vector a (hence 
normalized spin s = +1 along a); thus we have a sequence 
{■Si(a) = +1} for the normalized spins of the successive 
identically prepared particles. If one measures the spins 
of the particles so prepared along a vector 6, one obtains a 
sequence s^(6) of normalized spins along b such that sj(6) 
is either +1 or -1 (with probability one). Then Quantum 
Mechanics (QM) teaches us (and experiments confirm) 
that the average value (si(a)|s-(6)) is equal to cos(a, b). 
This is the spin-i version of the classical Malus law for 
polarization (see, e.g., [lj), which may be better known 
in terms of probabilities of coincidences: 

<s l{ d) = = 1 + = cos 2 (M) . 

Following Bohm's treatment |2| of the EPR Paradox 
we deal with electron-positron pairs and study the 
spins of the particles using Measurement Tools (MTs). 
The pairs are prepared in the singlet state (Jl/ ; p. 400) 
whose spin part is rotationally invariant and given along 
any vector by 

%{XI,X 2 ) = ^=(|+}l ® h>2 ~ h)l ® | + ) 2 ) , 

so that the total spin is (we are neglecting here a pos- 
sible phase problem that would not affect the strong in- 
equalities that one obtains at the end). 

One particle of each pair is observed using the MT E, 
and the other using the MT P, these MTs being operated 
by people or smart machines. Fix an MT vector x, i.e., 
a vector orthogonal to the axis along which the particles 
separate. One next measures with X, which stands for E 
or P, the normalized projection of the spin along x of one 
of the 2 elements of the i th pairs from a sequence of pairs 
prepared to be in the singlet state. The measured value 
is denoted by Xi(x), and also denoted by £ j if X = E 
and by Vi if X = P. The total spin zero property of the 
singlet state then reads: 

For all x and all i, E z (x) + P t {x) = £ l +V l = 0. 

A Bell Inequality (BI) is an inequality among some 
number of objects such as the average (Xi(x)\Yi(y)) or 
the probability of coincidence 7r(X;(a?) = Yi(y)). Here 
X and Y each stand for one of E and P while x and y 



belong to a collection of names of fixed or variable MT 
vectors whose size depends on the BI version. 

A Bell Theorem (BT) is (depending on the authors): 
[-] Either the contradiction obtained by replacing the ob- 
jects such as (Xi(x)\Yi(y)) or 7r(X;(a?) = Yi(y)) in a BI 
by their values provided by augmenting QM using the 
collection A of augmentations described next. 
[-] And/Or the implications of such a contradiction. 

The collection A of augmentations of QM: The collec- 
tion of augmentations of QM that can be used to allow 
one to formulate a Bell type argument we call A (for 
short, we will write QMA to mean "QM augmented by 
an element of A" , and often use just A to mean "any ele- 
ment of A" ) . Two of the most extreme and most common 
samples from the collection A are described next: 

- i) At one end of the spectrum of strengths of the aug- 
mentations of QM, A contains the strong hypothesis of 
Bell in Q (see also 0) that there are Hidden Variables 
(HVs) whose statistical properties are the same as for 
standard QM, and which are predictive (i.e., the future 
values of the observables are determined by the present 
state of the world, itself described using all the needed 
variables, including the HVs, even if no one can predict 
nor access all of the values of the HVs). 

- ii) Toward the other end of the spectrum of strengths, 
A contains what we call Classical realism (also called 
contra] 'actual definiteness 6|). According to this weaker 
hypothesis (used, e.g., in |6| and fjj in the BT context), 
the statistical properties are the same as for standard 
QM and (without assuming any predictability) an ob- 
servable has a value that is well defined whenever the 
measurement could be made, even if the measurement 
is not performed (for instance because another measure- 
ment is performed instead). 

A BT deals with QM^4, furthermore assuming locality 
in versions prior to the new one to be presented here, and 
one concludes that at least one of the following choices 
needs to be made: the augmentation gets disqualified to 
avoid the contradiction of that BT, or one uses the fact 
that the formerly known Bis themselves disappear if one 
drops the assumption of locality. 

Too broad a definition of locality could force us to take 
a stand on whether "QM has some nonlocal features" in 
a loose sense. To avoid the need to face such a choice, in 
this paper by locality we mean that if the evaluations of 
the spin projections are spatially separated then the val- 
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ues of the observables near one MT are not influenced by 
the settings of the other MT. More explicitly, by assum- 
ing locality the effect of choosing which measurement is 
made on any particle of a pair could not affect what is 
observed at spatial distance Ax and time difference At 
on the other particle if the observations are spatially sep- 
arated (i.e., c 2 At 2 < Ax 2 where c stands for the speed 
of light). Causality means that no information bearing 
signal can propagate faster than light and it is known 
that causality is not violated by a failure of locality be- 
cause, as proved in 0, correlations between the results 
of measurements on separate subsystems of any quan- 
tum system cannot be used to send signals. The new 
BT derivation proposed here does not use the locality as- 
sumption and thus challenges the usual claim that "BT 
indicates non-locality". 

B) A simple derivation of a BT: 

One of the simplest previous BT derivations has been 
adapted from || and reported by Penrose (as told to 
him by Mermin) in @, [ijj- We present it in two parts 
of which only the first will be used to present the new 
BT. 

- I) We assume henceforth that the particles are fast 
enough for the observations made at E and P on each 
pair to be spatially separated, with E and P fixed in the 
reference frame where the observations are synchronous 
or approximately so. Following || and [Hjj suppose that 
there are: 

- Two MT vectors for E, | -»■) defining 0°, and | f) at 
90° (we use here kets for vectors in physical space: the 
context should tell if a given ket represents such a vector, 
a spin state along that vector, or possibly both entities), 

- Two MT vectors for P, | /) at 45° and | \) at -45°. 

Take the actual settings to be respectively | — >) for 
E and | /*) for P for a first run of experiments, and 
call £ = £\, £2, ■ ■ ■ and V = V\, Vi, ■ ■ ■ the runs re- 
spectively registered by E and P. Then QM tells us 
that the probability n(£i = Vi) that £i and Vi coincide 
is \(\ - cos(45°)) = 0.146..., hence about 0.15. The 
QM prediction comes from (e.g.,) Wave Packet Reduc- 
tion according to which the entangled singlet state be- 
comes the product state ^'(xi, £2) = |+ff)i <8 |— 3)2 OT 
V?'(afi, x 2 ) — |— 0)1 ® |+o)2 after the E measurement 
along a = | — >) is performed. Then Pi(a) = —Ei(a) 
and the conclusion follows from Malus law. 

- II) In the previous versions one next assumes locality 
so that V does not depend on the E setting. Call £' the 
run that would have been registered by E if the alternate 
MT vector | f) had been chosen (assuming ^4) . The prob- 
ability tt(£- = Vi) of agreement between £' and V would 
then have been again equal to ^(1 — cos(45°)) = 0.146.... 
Next, if the E settings had been | — >) as initially de- 
cided, but the P settings had been | \), then the run 
at E would have been £ as before, using locality again. 
Denoting by V' the runs that would have been registered 



by P with the new setting of its MT vector (assuming 
A), the expectation of coincidence ir(£i = V-) between £j 
and V[ would have been again i(l — cos(45 )) = 0.146... . 
The punch line is then the difference between two ways 
of comparing the runs £' and V': 

[-] On the one hand we have the Bell Inequality 
(*) n(£' t = Pi)+ir(Vi = £ i )+n(£ i = V{) > *(£[ = V',) . 

Proof of (*). All entries being binary for £[ and V[ 
to agree requires at least one of the equalities £[ = Vi, 
V L = £i, or £i — Vi to hold true. 

For now we notice that for the angles that we have 
chosen, the Bell Inequality (*) yields the upper bound 
0.45=0.15+0.15+0.15 for the probability n(£< = V[). 

[-] On the other hand computing tt(£1 = V'j) directly 
according to QM.4 yields the equality |(1 — cos(135 )) = 
0.854..., contradicting the former 0.45 upper bound: this 
is the BT's contradiction that we were aiming at. 

The conclusion of Bell's Theorem is then: 
One at least of A or locality is false in order to explain 
why the statistical properties from QM violate the BI. 

It is locality that plays the role of the usual suspect 
(as clearly stated, e.g., by Penrose in p. 589, and by 
Aspect in pp. xxv-xxvi, following a tradition going 
back to Q). It is true that without locality the expres- 
sion 7r(£j' = V[) makes no sense and the conclusion van- 
ishes, but this hardly proves that locality is the essential 
hypothesis, and this paper precisely proves that it is not. 

C) A Bell Theorem with no locality assumption: 

- a) A new BI. 

Only I /) will ever be used at P while we shall con- 
tinue to consider both | — ►) and | |) at E in this section. 
Furthermore we assume that measurements are made of 
V and £ while £' is inferred to make sense, i.e., do not 
depend on choices that have not yet been specified (oth- 
erwise speaking to be well defined and have well defined 
(albeit unknown) values) by using the augmentation of 
QM by A. One could choose instead to not measure ei- 
ther of £ and £' , and use the augmentation of QM by A 
to infer that both of £ and £' have well defined values: 
this will be done in the Appendix (see point a5)) in order 
to focus on a single BT in the main text. Whenever time 
ordering is considered in a Lorentz frame between two 
space-time events, the time that the two signals travel 
at light speed to an observer in that frame is taken into 
account. We will use the following: 

Effect After Cause Principle (EACP): For any 

Lorentz observer and for any X in {£, £', V}, a value 
Xi does not change from any cause that happens after 
Xi has been either measured or inferred to make sense in 
view of QMA for that observer. 



3 



EACP vs locality Lemma. The EACP is different 
from locality. 

See the Appendix for three proofs of this statement. 

Definition. We call an E-P observer a Lorentz observer 
for whom measurements at E occur before measurements 
at P for each pair, while a P-E observer travels in the 
opposite direction. The i?-side sequences depend on Vi 
for P-E observers and our E-P observers will not even 
consider the P side. 

New Bell Inequality. The sum of all three probabilities 
of coincidences for the chosen triple of pairs must be over 
n(Pi = Pi) = I, or 

(**) ir{Vi = Si) + ir(£i = £[) + tt(^ = V t )>l. 

Proof of (**). Collecting the results from an E-P ob- 
server (for n(£i — £■)) and a P-E observer (for the two 
other probabilities), it follows readily from the EACP 
that the three sequences £ , £' , and V involved in (**) 
make sense. All entries being binary, for any Vi to 
agree with itself, one needs at least one of the equalities 
Vi = £i, £i = £■, or £■ = Vi to hold true. 

- b) From the new BI to a BT. 

No Correlation Lemma. The sequences £ and £' are 
not correlated, i.e., 

(o) (£i\£'i) =0 or equivalently n(£i = £' i ) = ^ . 

Proof of the No Correlation Lemma. Using the EACP 
and the conclusion deduced from it in the proof of (**) 
that "the three sequences £, £' , and V involved in (**) 
make sense" , we notice that only the orientation of the 
angle (| — >), | f)) at E could matter for an E-P observer, 
so that (o) follows from invariance under parity without 
assuming locality. 

To see the role of parity, we introduce the further vec- 
tor — | t) to which would correspond the sequence £" such 
that £'( = 1 - £[. Since n(£i = £[) + tt(£ 4 = £'{) = 1 it 
only remains to prove these two probabilities to be equal 
to each other. We use here sequences that are possibly 
unknown, but known to be well defined: 

- one, £, is known by direct measurement, 

- the other £', can be inferred to be unknown but well 
defined by an E-P observer using QMA 

The only thing that could generate inequalities is the dif- 
ference in the orientations of the angles (| — >), | j)) an d 
(| — ►),— | I)). Equality thus follows from parity invari- 
ance. 

New Bell Theorem: We can use the triplet 
(■K{Vi=£i),Tr(£i=Vi),n{£i=£!)) = (0.15,0.15,0.5) 
of probabilities of coincidences in (**) specialized to the 
chosen angles, yielding 0.8 > 1 as the false inequality 
that is the contradiction that we seek. Since no locality 
assumption is used in the derivations of the triplet and 



(**), only the counter-natural character of the gedanken 
experiment (i.e., its dependence upon observable values 
that cannot be measured and are thus inferred to make 
sense assuming A ) can be the cause of the contradiction 
between Quantum Mechanics statistics and the new Bell 
Inequality. 

Proof of the New Bell Theorem. Again we assume the 
EACP (in full compatibility with all experiments made 
so far) and use (**) and the proof of it that is provided 
above. 

- 1) After measurements are made using P, a P-E 
observer obtains that: 

- el) n(Vi = £i) is about 0.15 by QM, or by direct obser- 
vation after measurements are also made using E, 

- e2) it(V l = £[) is about 0.15 by QMA 

The deductions made in el) and e2) using Q1VL4 go as 
follows: By Wave Packet Reduction (for instance), the 
spin state of second particle (the particle on the E side) 
becomes 

*(»2) = \Vi)l®\-Vi) 2 

along the vector | /) (along which the sequence V is mea- 
sured) as soon as the measurement of Vi is made on the 
P side. Hence the second particle gets into a spin state 
prepared to be | — Vi) along the vector | /) (as revealed 
by using the information obtained on the P side) so that 
both of the two probabilities Tr(Vi — £i) and n(Vi = £■) 
are equal to about 0.15 by a simple application of the 
spin-i version of Malus law for polarization as we have 
recalled it, under the assumption found in both of the 
rather extreme elements of A described above that the 
statistics of QM.4 is the same as the statistics of standard 
QM. 

- 2) An E-P observer infers: 

- e3) n(£i = £■) = 0.5 on the E side by the No Correlation 
Lemma. 

Assembling the conclusions el), e2), and e3) from 
the two (strongly) asynchronous frames (e.g., in the 
Lorentz frame of the experiment since the outcomes can- 
not change according to the Lorentz frame) one obtains 
the expected triplet evaluation for the three probabili- 
ties: (0.15,0.15,0.5). Together with (**), this evaluation 
provides us with the impossible inequality 0.8 > 1. The 
rest of the theorem follows from checking the hypothe- 
ses that are used in the proof that we have given of the 
contradiction. This concludes the proof of the New Bell 
Theorem. 

Our New Bell Theorem admits the following immediate 
corollary that we will use as our conclusion: 

Corollary: It is the counter-natural character of the rea- 
soning, as permitted by any form of A, and not locality, 
that is the only problem common to all the versions of 
Bell Theorem, since one needs to assume some form of 
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A in any argument of Bell 's type. Thus Non-locality is 
not needed (in some circles, one would say that it can be 
disposed of using Occam's razor). 

D) Appendix: 

- al) For an analysis of the essential equivalence of all 
the Bis that use some locality asumptions, see ^l) and 

The probabilistic approach of Fine was later con- 
tinued by Pitowsky (see ^]| , [3 and references therein) 
who identified BI's as examples of inequalities going back 
to Boole [l^. This lets us understand Bis as resulting 
from a classicality assumption. 

- a2) Assuming locality a simple proof of (o) (that 
readily follows from the arguments of Bell in |4j) was 
given by Stapp (see equation (7) in ||). 

- a3) The inequality (**) is a special case of (2a) in 
^j , established there assuming locality (see also (4) in 

- a4) We could not have used (*) instead of (**), but 
we could have used the following weaker but new form of 
BI: 

min[max(7r(^ = £,•), t(7>< = £'M > 1 ~ ^ - £i) ■ 

- a5) If no measurement is made on the E side, one can 
still prove (o). We start as in the case when one measures 
£ . Like in that case (c/. the main text) the only thing 
that could generate inequalities is the difference in the 
orientations of the angles (| — ])) an d (I ~^),— I ?))• 
But now, £ and £' have symmetrical roles because none 
of the sequences gets to be measured. Both of these se- 
quences are now inferred to have definite value by an 
E-P observer on the basis of Q1VL4. Reversing the order 
of the elements of the second pair one would find | — >) 
in the role payed by | f) in the first pair, from which the 
equality follows immediately for an E-P observer assum- 
ing isotropy (no need of parity invariancc in this case): 
this provides us with e3). The proof of e2) is unchanged 
and el) is proved like e2). We thus recover a BT when 
no measurement is made on the E side. 

- a6) Three proofs of EACP^locality. Let us not as- 
sume locality. Then in the setting for a BI presented in 
part B), the EACP would still allow a P-E observer to 
make sense of it {Pi = £i) and of it {Pi = £|) as was done 
in the proof of the New Bell Theorem to justify el) and 
e2) and, in the same way, an E-P observer to make sense 
of w(£i — Pi) and of ir{£i — P[). However one cannot 
extract 7r(£| = V'A from augmenting QM by A with- 
out making a more precise assumption, to the contrary 
of what happens if on assumes locality. This concludes a 
first proof that EACP^locality. To get another proof, no- 
tice that if one assumes non-locality in its strongest form 



together with EACP, one gets that tt(£[ = P[) = 0.5 
(meaning statistical independence) instead of the 0.85 
that was obtained in section B) under the usual locality 
assumption, enough though, since 0.5 > 0.45, to get a 
BT. To get a third proof, notice that by violating the 
EACP when measurements are made, one would (easily) 
enable super-luminal signaling, to the contrary of what 
happens with any locality violation in view of [8j . Details 
are left to the reader. 



Remark: If one uses a broader and more common defi- 
nition of locality than the one that we have chosen, things 
get even simpler since, as Richard Frieberg puts it "the 
difference between EACP and locality is that locality in 
a relativistic context means (in its usual broader sense) 
that no influence at all can be exerted across spacelike 
separation" . 
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